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Abstract — The problem addressed in this paper is the analysis 
of a distributed consensus algorithm for arbitrary networks, 
proposed by Benezit et al.. In the initial setting, each node in the 
network has one of two possible states ("yes" or "no"). Nodes can 
update their states by communicating with their neighbors via 
a 2-bit message in an asynchronous clock setting. Eventually, all 
nodes reach consensus on the majority states. We use the theory 
of electric networks, random walks, and couplings of Markov 
chains to derive an 0(N 4 log N) upper bound for the expected 
convergence time on an arbitrary graph of size N. 

Index Terms — Distributed binary consensus, gossip, conver- 
gence time 

I. Introduction 

The problem of distributed and quantized average consen- 
sus on networks has received considerable attention recently 
(DEI 1 3 1. It models averaging in a network with finite capacity 
channels ]3). It is of interest in the context of coordination of 
autonomous agents, estimation, and distributed data fusion on 
sensor networks, peer-to-peer systems, etc. HI (4). A special 
case of distributed averaging is the distributed binary voting 
problem, where all the nodes in the network vote "yes" or 
"no" and a majority opinion is desired OH. 

Distributed algorithms requiring limited communication and 
computation are specially appealing to remote and extreme 
environments. An example to motivate this problem is sensor 
decision-making in a network. For example, as shown in Fig. 
[T] sensors are deployed to measure if an earthquake is happen- 
ing or not in a certain region. The limited communication only 
happens between sensors. Each sensor has limited battery. An 
algorithm with good convergence speed is desired. 

This work is motivated by the distributed binary voting 
algorithm proposed by Benezit et al. Q. They showed that 
the algorithm reaches consensus on a quantized interval that 
contains the average almost surely. However, they did not 
analyze the convergence time. In JS), the authors studied the 
convergence speed in the special case of regular graphs for 
a similar distributed binary consensus algorithm. Draief et al. 
derived an expected convergence time bound depending on the 
second largest eigenvalue of a doubly stochastic matrix charac- 
terizing the algorithm and voting margin (4), instantiating the 
bound with some particular networks, yet no specific bound 
is provided for an arbitrary graph. In this paper, we derive 
an 0(N 4 logN) upper bound for the convergence speed of 
the distributed algorithm for arbitrary connected graphs using 




Fig. 1. Sensor nodes deployed in decision-making. 

results on electric networks, random walks, and Markov chain 
coupling . 

The remainder of this paper is organized as follows. Section 
2 describes the algorithm proposed in and formulates 
the convergence speed problem. In Section 3, we derive our 
polynomial bound for this algorithm. In Section 4, we give a 
simple example on how to derive an upper bound given the 
topology of the network, simulation results are provided to 
justify the analysis. We provide our conclusions in Section 5. 

II. Problem Statement 

A network is represented by a connected graph Q — (V,£ ), 
where V = {1, 2, ...,N} is the set of nodes and £ is the set 
of edges. £ £ if nodes i,j can communicate with each 
other. Mi is the set of neighbors of node i. 

Consider a network of N nodes, labeled 1 through N. 
As proposed in [1|[3|[5|, each node has a clock which ticks 
according to a rate 1 exponential distribution. By the superpo- 
sition property for the exponential distribution, this set up is 
equivalent to a single global clock with a rate N exponential 
distribution ticking at times {Zk}k>o- The communication and 
update of states only occur at {-ZfclfcX)- When the clock of 
node i ticks, i randomly chooses a neighbor j from the set Mi. 
We say edge is activated. Let S^'(t) denote the state of 
node i at time t. S®(t) G S~ , W + , W~}, representing 
strong positive, strong negative, weak positive, and weak 
negative respectively, where S — ±2 and W ± = ±1. 
The two nodes i,j then update their states according to the 
following update rules: 



st — I + Tt 

w*- 1 rs — " lJ 

w- ~f — : , ~~ 

s- 1 ' ^ 

Fig. 2. Update rules for distributed binary vo(e(5)- The figure shows update 
principles: when opposite "strong opinion" s meet, they both turn into "weak 
opinion" s; "strong opinion" affects "weak opinion"; and swap principle. 

1) If SW(t) = S<S)(t), 

s«(t + i) =sU){t + i) = ffW(*); 

2) If > and • < 0, 

SW(t+l) = -£<*>(t), SW5(*+l) = S w (t), and vice 
versa; 

3) If \S®{t)\ > \SW(t)\ and S«(t) ■ S«(t) > 0, 
fiW(i + 1) = S^t), 5W(t + 1) = SW(t), and vice 
versa; 

4) If = -SW>(t), 
S®(t + l) = sign(sW(t)) , 
SU>(t+i) = sign (S^(t)) . 

Definition 1 (Convergence). A binary voting reaches conver- 
gence if all states of nodes on the graph are positive or all 
states are negative. 

We show the update rules in Fig. [2] Note that this algorithm 
supposes that there is an odd number of nodes in the network, 
in order to guarantee convergence regardless of initial votings 
of nodes. 

Let | S + 1 denote the number of the Strong Positive opinions 
and |S ,+ (i)| denote the number of the Strong Positive opinions 
at time t. A quick validation of the convergence of the 
algorithm in Section [IT] we notice that the S + and S~ will 
only annihilate each other when they meet, otherwise they just 
take random walks on the graph. So only the maj oritysf rong 
opinions will be left on the graph in the end. We also notice 
that strong opinions can influence weak opinions as shown in 
Fig. [2] Eventually all agents will take the sign of the majority 
strong opinions. Because the graph has finite vertex, and this 
Markov chain has finite states, thus convergence will happen 
in finite time almost surely. In this paper, we are interested in 
studying the convergence time of this distributed algorithm on 
an arbitrary graph. 

III. An Upper Bound on the Convergence Time for 
Distributed Binary Consensus 

The main result of this work is the following theorem: 

Theorem 1. The upper bound for the binary voting algorithm 
is C(iV 4 log(iV)). 

We use the analogy of electric networks and random walks 
to derive the upper bound. Before we proceed to the formal 
proof, it is helpful to give some definitions of random walk 
and provide the notation we will use in the proof. 



A. Definition and Notation 

Definition 2 (Hitting Time). For a graph Q, let H(i, j) denote 
the expected number of steps a random walk beginning at i 
must take before reaching j. Define the "hitting time " of Q by 
H(Q) = maxjj H(i,j). 

Definition 3 (Meeting Time). Consider two random walkers 
are placed on Q. At each tick of the clock, they move according 
to some joint probability distribution. Let A4(i,j) denote the 
expected time that the two walkers meet at the same node or 
they cross each other through the same edge ( if they move 
at the same time). Define "meeting time" of Q by M(Q) = 
maxjj M.(i, j). 

Define a simple random walk on Q, X$, with transition 
matrix P s = (Pij): 
. Pi~0 for Vi € V, 

• P ij : = WT\ for (^') e £ - 

Mi is the set of neighbors of node i and \M%\ is the degree of 
node i. 

Define a natural random walk with transition matrix 
P N = (Pij): 

. Pg = 1 - i for V^ e V, 

• P il = wm for(i,j)££. 

Define a biased random walk Xb with transition matrix 

P B = {Pa): 

. i^:=l-^-£ feeM ^forWGV, 

• ^^Trim + m) for (*>i) ef - 

B. The Analogy 

Before two opposite strong opinions m, n meet each other, 
they take random walks on the graph Q. Their marginal 
transition matrices are both P B . It may be tempting to think 
that they are taking natural random walk as stated in [2| for 
a similar problem, but after a second close look, we find that 
there are two sources stimulating the random walk from i to 
j, V(i,j) € £: one is the clock of the node i, P^, — P^; 
the other one is the clock of its neighbor j, Pfj = Pi\ . Thus 
Pij = Pij + Pfj, i.e., the transitional matrix is actually P B 
instead of P . Denote this random process as X. Since the 
two random walks m, n cannot move at the same time, if they 
are not adjacent. Suppose m is at node x, and n is at node y. 

For x ^ My, and i £ M x , we have 

-P^joint( m moves from x to i, n does not move) 
= P B (m) — -P;tjoint(m moves from x to i, n moves) 
= P*i- (1) 

Similar for Pxjomt{n moves from y to j, m does not move). 
Also, 

-P^joint(™ does not move, n does not move) 



For x G N y and i^ywe have, 

Px\omt{ m moves from x to i, n does not move) 
= Pxi(m) — -P;tjomt(m moves from x to i, n moves) 



= P L 



(3) 



Pxiomt{rn moves to y, n moves to x) = P B y (4) 
Pxiomtim does not move, n does not move) 

= i - p b - p b + p b (5) 

C. Meeting Time on Weighted Graph 

Lemma 1. The biased random walk Xb is a reversible Markov 
process. 

Proof: A Markov process is said to reversible if the 
transition probability, P, between each pair of states i and 
j in the state space obey HiPij = TtjPji- 

Let 7r be the stationary distribution of Xb- It is easy to get 



1 

N 



(6) 



for all i 6 V. Thus 



W . P B =7T . P B 

1 ij ~ 3 31 ' 



Lemma 2. H P B(x,y) + T-LpB(y,z) + % p b{z,x) = 
V. p b (x, z) +% p b (z, y) + H p b (y, x). 

Proof: A direct result from Lemma 2 in Chap 3 of 

Aldous-Fill's book [7| since Xb is reversible. ■ 



Theorem 2. U p b (Q) < 



N 



Proof: The biased random walk Xb defined above is a 
random walk on a weighted graph with edge weight 



1 f 1 



U'i 



1 



N VIM | IM| 



for (i,j)e£. (7) 



wu := 1 - ^ Wij. 



Wi = Wij = 1, w 
jev 



E 



Wi = N. 



(8) 



(9) 



It is well-known that there is an analogy between a weighted 
graph and an electric network, where a wire linking i and j 
has conductance Wij, i.e., resistance [7|[8|. And they 

have the following relationship 



UpB(x,y) +H P B(y,x) = wr' xy , (10) 

where r' xy is the effective resistance in the electric network 
between node x and node y. Since the degree of any node is 



at most N 



1, for G £, 

1/1 



w, 



> 



> 



N 
1 



i 



i 



N K N- 



1 



Thus Tij < 



N N N 
And consequently, r! 



N- V 
2 



(11) 



< rij < ns-, indicating 



N_ 

2 . . . y 

that the effective resistance between any two neighboring 

T 2 



nodes is less than N /2. 



For Vx, y G V, 



< (N-l) 



< 



JV3 

2 



The worst case 



is TV — 1 resistors connected in serial. Connecting any more 
resistors between any two nodes will only decrease the total 



resistance. By Equation (10 1, we have 



H pB (x,y) < 



H P B(x,y) +H P B(y,x) 
wr' 



< 



TV- 
~2~ 



xy 

N 3 
~2~ 



(12) 



This completes the proof. ■ 
Note that this is an upper bound for arbitrary connected 
graphs. A tighter bound can be derived for given topology of 
networks. A simple example of star networks will be given in 
the next section. 

Definition 4 (Hidden Vertex). A vertex t in a graph is said 
to be hidden if for every other point in the graph, %{t,v) < 
T-L(v,t). A hidden vertex is shown to exist for all reversible 
Markov chains in /|9y. 

Theorem 3. The meeting time of any two opinions on the 
network Q is less than A%pb{Q). 

Proof: In order to prove the theorem, we construct a 
coupling Markov chain, X' to assist the analysis. X' has the 
same joint distribution as X in Section |HI-B| except Equation 
Q and Equation([5]l. 



Px'joini(m, n meet at x or y) = 2Pj 



(13) 



Px 'joint (m does not move, n does not move) 

= i X /'/: X r!-. (14) 



jesV v 



First, we show that the meeting time of two random walkers 
following X' is less than 2T-L p b(Q). 

For convenience, we adopt the convention of the following 
notation: if /(•) is a real valued function on the vertex of 
the graph, then f(v) is the weighted average of f(u) over all 
neighbors u of v. 

Similar as in [2 |[9], define a potential function 

<f>(x, y) := U p b (x, y) + H p b (y, t) - U p b (t, y), (15) 

where t is a hidden vertex on the graph. By Lemma [2] (f>(x, y) 
is symmetric, i.e. <fi(x,y) — <fi(y,x). By the definition of 



meeting time, A4 is also symmetric, i.e. A4(x,y) = A4(y,x). i.e., 

This gives us intuition that we may be able to use 4> to help i 
us bound the meeting time. M x > (x,y) = — ^ R + M x >(x,y) 

By the definition of hitting time, for x ^ y we have 



ieW x 

n P s(x,y) J2 P^{M x -{x,y)-M x .{x,y)) 



l + P x 3 x H pB (x,y) + E Pin P u(i,y) 



V p 1 



l + w xx n pB (x,y)+ J2 w xl Hps(i,y), (16) ieAT. 



i.e., 



1 , Eie7V x w x ,H P B(i,y) 



< ^ +M X '(x,y). (24) 

i&A/i, 



H P B(x,y) = h 31 — Now we claim that M x >(x, y) < 4>{x,y). Suppose it is not 

^2ieAf x Wxi SieAA x w xi the case. Let /3 — m&x x>y {M. x >(x, y) — y)}. Among all 

1 hHfa; ) (17) ^ e P a * rs x > y rea li zm S P< choose any pair. It is clear that 

Z~2i^Af Wxi ' x V' since A4 X '(x, x) — < 4>{x,x). By Equation (18 1 

and Equation p4| ), 

So for x 7^ y, 

M x >(x,y) = 

I \ > ^ + M x >(x,y) 

= 1 + 1 - ^ P% - ^ \ M x > (x, y) > A^,(x,y). (25) 

\ ieAT x jeJV y / 

pBj^ /• \ Contradiction. Thus M^'(^) < 4>[x,y) < 2% p b{Q). 

x% x ' Now we are ready to proof theorem [3] Compare the joint 

distribution of A" and A"', we notice that the two two- 

+ -Pyj-M x >(x,j). (19) dimensional Markov chains are coupled until two random 

j^Ny walkers meet, because half of the time when the two random 

Note that Equation (15) also holds for x e M y . We now have walkers in X ' meet ' the random walkers in X do not ' but sta y 
' — ' in the same position. We claim that M. X {Q) < 2A4 X >(G). 

)In the random process X' , when two random walkers m, n 
M X '(x,y) meet, instead of finishing the process, we let them cross and 

keep doing random walk according to P^'joint- The expected 
length of each cross is less than or equal to M x ugy At each 
cross, the random process X finishes with a probability of 1/2, 
independently. Thus for any x,y 6 V we have 



1+ J2 P*iM x ,{i,y) + E Py 3 M x ,{x, 3 ).(20) 



Equation ( 20 » shows that at least one of the two inequality oo 

below holds: M x (x,y) < ]T ( - ) %M X '(S) = 2Mx>(G). (26) 



i=l 



M x' {x, y) > ^ 75b = M x' V) (2 V This completes the proof. 



J2ie\r P£M x >(x,j) 

M x >(x, y) > — - — - — — n = M. X i(x,y) (22) Without loss of generality, let us suppose that in the initial 

SieAA,, Pyi settine. more nodes hold strons nositive oninions (S + ). As 



D. Convergence Speed Analysis 

Without loss of generality, let 

jeAfy r yj setting, more nodes hold strong positive opinions (S + ). As 

...... „ .. , „ fTTTi , ,j briefly analyzed in Section IIII-AI the process undergoes two 

Without loss of generality, suppose that Equation (12211 holds , , , • r n- — , , , • r „ 

, . " „ "-pr 1 stages: the depletion of S and the depletion of W .By our 

(otherwise, we can prove the other way round). From Equation . 

m, we have assumption, 

^ |^ + (0)| > |5"(0)| (27) 



J2 P£M x >(x,y) = 1 + E P£M x >(i,y) 

E PyjM x/ (x,j) - E P%M x .{x,y). (23) 
jeNy j'eATj, where N is the number of nodes on the graph 



and 

|S+(0)| + |S-(0)|=AT, (28) 



According to the update rules in Section |TTJ we have 

\S + (t)\-\S-(t)\ = \S+(0)\-\S-(0)\. (29) 

Let T\ and T2 denote the maximum expected time it takes 
for Stage 1 and Stage 2 to finish. In the first stage, two opposite 
strong opinions annihilate when an edge between them is 
activated. Otherwise they take biased random walk on the 
graph Q. In the second stage, the remaining |S' + (0)| — |5 _ (0)| 
Strong Positives take random walks over graph Q, transforming 
Weak Negative into Weak Positive. Let CTg(v) denote the 
expected time for a random walker starting from node v to 
meet all other random walkers who are also taking random 
walks on the same graph but starting from different nodes. 
Define 



CT(G) 



max CTg{v). 



Corollary 1. 



CT{G) =0(N 4 log N). 



(30) 



Proof: In terms of the global clock ticks {Zk}k>o m our 
setting, by Theorem [2] and Theorem [3] we have 



Mia) < m. 



< A 4 



(31) 



|iV 5 is available since 



(S) ^ z «(S,P f 

Thus a union bound for CT(Q) is 
there are no more than ^ consecutive meetings. 

In order to obtain a tighter bound for CT(Q), we divide the 
random walks into In A periods of length kM.(G) each, where 
k is a constant. Let a be the "special" random walker trying 
to meet all other random walkers. For any period i and any 
other random walker v, by the Markov inequality, we have 



< 



so 



Pr(a does not meet v during period i) 

kM X '{Q) 
1 

k 



Pr(a does not meet v during any period) 

In TV 



< 



= N~ 



Ink 



(32) 



(33) 



If we take the union bound, 

Pr(a doesn't meet some walker during any period) 
< N-N~ Uk . (34) 

Conditioning on whether or not the walker a has met 
all other walkers after all kM. (<?) In A steps, and using the 
previous 0(N 5 ) upper bound, we have 



,1 



CT{G) < kM{g)]nN + A ■ N~ lnk ^N 5 



= kM(g)lnN+^N' 



6— In k 



(35) 



When k is sufficiently large, say k > e 6 , the second term 
is small. So 

CT{Q) < cA 4 In A. (36) 




Fig. 3. A star network. 



This completes the proof. 



Corollary 2. 



Stage 1: T x < 2CT{G) 



(37) 
(38) 



Stage 2: T 2 < CT(G) 

Proof: In order to prove p7| ), we can construct a coupling 
process of stage 1 . The coupling process is when two different 
strong opinions meet, instead of following the rules to change 
into weak opinions, they just keep their states and keep moving 
along the same path they would have as weak opinions. This 
process is over when every strong opinion has met all other 
opposite strong opinions, by when Stage 1 must have finished, 
i.e. before at most N 2 /4 such meetings. The rest of the proof 
just follows from Corollary [T] except we divide the random 
walks into ln(iV 2 ) periods of length kM(Q) instead of InN. 
Equation p8) follows from the fact that there are at most 
N — 1 meetings which a single strong opinion meets all the 
weak opinions to make sure convergence. ■ 
Theorem [T] is then a direct result from Corollary [T] and 
Corollary [2] 

IV. Simulation Results 
In this section, we give a simple example of star networks 



in order to show how to use the analysis in Section III for 
the particular graph with known topology. Simulation results 
are provided to validate the analysis. We also simulate the 
distributed process on Erdos-Renyi random graph in order to 
get an insight on how the algorithm performs on a random 
graph. 

A. Star Networks 

Star networks are one of the most common network topolo- 
gies. A star network S of N nodes has one central hub and 
N — 1 leaf nodes, as shown in Fig. [3] Now let us derive an 



upper bound following the similar analysis in Section III 



1 ) Analysis: By Equation (|7]i in Section III-C suppose that 
there is a star network of nodes, with the central hub 
denoted as c. For ^ c, we have 

1 N 1 (39, 



It',. 



N — 1 



N - 1 



The equivalent resistance between any two leaf nodes i and j 
is 



1 



1 



VJC 



2N - 2. 



(40) 



50 100 150 200 250 300 350 400 450 500 
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N 



Fig. 4. Average convergence time (green squares) versus the size of the star 
network. The blue solid line indicate 0.63iV 2 log N. 



Fig. 5 . Average convergence time versus the size of the Erdds-Renyi random 
graph. The blue solid line indicates 2N 2 log N, and the red dash line 
indicates 2.3N 2 log TV. 



By the symmetry of the star network, it is easy to see that 

H pB (i,j)=H P B(j,i). (41) 

By Equation ( [T0| , 

H pB (S)=H pB (i,j)=N(N-l). (42) 

Then following Theorem 2 and similar analysis of Corollary 1, 
2 and 3, we can bound the convergence time of a star network 
by 0(/V 2 logiV). 

2) Simulations: In order to justify the bound we derived in 
Section [rV-Al| we did simulations on star networks with nodes 
N ranging from 21 to 481, with an interval of 20. Initially, 
there are ceil(N) strong positive and floor(N) strong nega- 
tive nodes, i.e., — \S~\ — 1. Those nodes communicate 
with each other following the protocol in Section [TT] The 
process finishes when consensus is reached. Convergence time 
is the average of 20 rounds of simulations. 

We plot the average convergence time (green squares) versus 
the size of star networks in Fig. |4] As in the figure indicated 
by the solid blue line, 0{N 2 log N) is indeed an upper bound. 
Fig. [4] also indicates that this bound is tight for a star network. 

B. Erdds-Renyi random graph 

In a Erdos-Renyi random graph 1Z, an edge is set between 
each pair of nodes independently with equal probability p. As 
one of the properties of Erdos-Renyi random graphs, when 
p > lr° sJV > the g ra ph will almost surely be connected 

E3. 

^(number of edges) = 0.5N(N - l)p. 



The diameter of Erdos-Renyi random graphs is rather sensitive 
to small changes in the graph, but the typical distance between 
two random nodes on the graph is d = ^ 01 01 . 

We created Erdos-Renyi random graph by setting p = 
5 log N/N, where N ranged from 21 to 481, with an interval 



the figure, the expected convergence time is in the order of 

iV 2 logiV. 

V. CONCLUSIONS 

In this paper, we use the theory of electric network, random 
walks, and couplings of Markov chains to derive a polynomial 
bound on convergence time with respect to the size of the net- 
work, for the distributed binary consensus problem addressed 
in 0. It can be extended to a tighter bound for any given 
topology of network using the effective resistance analogy. 
Our result provides insights of the performance of the binary 
consensus algorithm, and with applications in sensor networks, 
distributed computing, peer-to-peer systems, etc.. 



of 20. Other settings are the same as in Section IV-A2 We 
plot the average convergence time (green squares) versus the 
size of Erdos-Renyi random graph in Fig. [5] As indicated in 



In the analysis in Section III we notice that if some sensors 
do not have an observation, we can randomly assign weak 
opinions to them. Doing this does not change the consensus 
result if there is a majority opinion at the beginning. 
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